On the doubt about margin explanation of boosting
نویسندگان
چکیده
Margin theory provides one of the most popular explanations to the success of AdaBoost, where the central point lies in the recognition that margin is the key for characterizing the performance of AdaBoost. This theory has been very influential, e.g., it has been used to argue that AdaBoost usually does not overfit since it tends to enlarge the margin even after the training error reaches zero. Previously the minimum margin bound was established for AdaBoost, however, Breiman [10] pointed out that maximizing the minimum margin does not necessarily lead to a better generalization. Later, Reyzin and Schapire [34] emphasized that the margin distribution rather than minimum margin is crucial to the performance of AdaBoost. In this paper, we show that previous margin bounds are special cases of the kth margin bound, and none of them is really based on the whole margin distribution. Then, we improve the empirical Bernstein bound given by Maurer and Pontil [28]. Based on this result, we defend the margin-based explanation against Breiman’s doubt by proving a new generalization error bound that considers exactly the same factors as Schapire et al. [35] but is uniformly tighter than Breiman [10]’s bound. We also provide a lower bound for generalization error of voting classifiers, and by incorporating factors such as average margin and variance, we present a generalization error bound that is heavily related to the whole margin distribution. Finally, we provide empirical evidence to verify our theory.
منابع مشابه
A Refined Margin Analysis for Boosting Algorithms via Equilibrium Margin
Much attention has been paid to the theoretical explanation of the empirical success of AdaBoost. The most influential work is the margin theory, which is essentially an upper bound for the generalization error of any voting classifier in terms of the margin distribution over the training data. However, important questions were raised about the margin explanation. Breiman (1999) proved a bound ...
متن کاملOn the Margin Explanation of Boosting Algorithms
Much attention has been paid to the theoretical explanation of the empirical success of AdaBoost. The most influential work is the margin theory, which is essentially an upper bound for the generalization error of any voting classifier in terms of the margin distribution over the training data. However, Breiman raised important questions about the margin explanation by developing a boosting alg...
متن کاملMargin Distribution Controlled Boosting
Schapire’s margin theory provides a theoretical explanation to the success of boosting-type methods and manifests that a good margin distribution (MD) of training samples is essential for generalization. However the statement that a MD is good is vague, consequently, many recently developed algorithms try to generate a MD in their goodness senses for boosting generalization. Unlike their indire...
متن کاملBoosting the margin: A new explanation for the effectiveness of voting methods
One of the surprising recurring phenomena observed in experiments with boosting is that the test error of the generated classifier usually does not increase as its size becomes very large, and often is observed to decrease even after the training error reaches zero. In this paper, we show that this phenomenon is related to the distribution of margins of the training examples with respect to the...
متن کاملBoosting as a Regularized Path to a Maximum Margin Classifier
In this paper we study boosting methods from a new perspective. We build on recent work by Efron et al. to show that boosting approximately (and in some cases exactly) minimizes its loss criterion with an l1 constraint on the coefficient vector. This helps understand the success of boosting with early stopping as regularized fitting of the loss criterion. For the two most commonly used criteria...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Artif. Intell.
دوره 203 شماره
صفحات -
تاریخ انتشار 2013